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(57) A database computer system includes memo- 
ry, residing in a plurality of interconnected computer 
nodes, for storing database tables. Each database table 
has a plurality of columns, a primary key index based 
on a specified subset of the columns, and an associated 
table schema. At least a subset of the database tablos 
are partitioned into a plurality of partitions, each partition 
storing records having primary key values in a prima-y 
key range distinct from the other partitions. Atransaction 
manager generates and stores an audit trail, each audit 
entry denoting a database table record event, such vs 
an addition, deletion or alteration of a specified data- 
base table record in a specified one of said database 
tables. Four online data definition procedures allow the 
structure of a database table to be altered while the da- 
tabase table remains available to execution of transac- 
tions, with minimal impact of the availability of the data- 
base table for transaction execution. The four online da- 
ta definition procedures are a move partition procedure, 
a split partition procedure, a move partition boundary 
procedure, and a create new index procedure. Each of 
these online procedures has three or four phases of ex- 
ecution. In a first phase, records of a table partition or 
the entire table are accessed using read only access, 
so as to generate a new partition, move records b-3-" 
tween two partitions, or to create a new index. In a sec- 
ond phase, audit trail entries are used to clean up the 
data structures created during the first phase. In a third 
phase, access to the database table is briefly locked 
while audit trail entries created after the second phase 
are used to make final changes to the data structures 



created during the first phase, and while the database 
table schema is updated to reflect the changes to the 
database table produced. 
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Description ' ; 

The present invention relates generally to database management 'systems, and particularly to a database man- 
agement system in which database table availability is maintained, with minimal or no user availability outages during 
5 table restructuring operations such as splitting a table or index partition, moving an existing table or index partition, 
creating a new index and moving a table or index partition boundary. - : 

BACKGROUND OF JHE INVENTION f . , 

10 Database configuration and reconfiguration operations can have a significant effect on the availability of user 

applications that need access to databases undergoing structural changes. The Tandem™ NonStop™ SQUMP rela- 
tional database management system (DBMS), prior to the present invention, allowed read access, but not write access, 
to the portions of the database table undergong the restructuring operation. 

Although most users perform these operations infrequently, their duration can account for thousands of minutes 

is of application outages per year. A discussion of the cost of application outages appears in the article "An Overview of 
NonStop SQUMP," Ho et al., Tandem Systems Review, July 1994. 

The present inventbn eliminates most of the downtime associated with four database reconfiguration operations: 
Move Partition, Split Partition, Move Partition Boundary, and Create Index. The Move Partition procedure moves a 
partition that resides on one disk to another disk. The Split Partition procedure splits one partition into two. The Move 

20 Partition Boundary procedure moves rows of a base table between acijacent partitions, typically shifting rows from a 
large partition to a less large partition. The Create Index procedure creates an efficient alternate access path to a 
database table by ordering data according to the value specified in the key columns. The Create Unique Index variant 
of this procedure ensures that there is exactly one alternate access pain to each record in tile database table. 

The implementation of these procedures in the present invention substantially reduces, but does not eliminate, 

25 associated outages. Even with the present invention, user database activity continues to be restricted for about one 
minute or less per database restructuring operation. The outage time varies depending on the number of user trans- 
actions running against the table being restructured, the size of those transactions, and the number of partitions in the 
affected table 

It is therefore a primary object of the present invention to enable substantially improved user transaction access 
30 to a database table white the table is undergoing a structural change operation. 

Another object of the present invention is to enable Move Partition, Split Partition, Move Partition Boundary, and 
Create index operations to be performed on a database table while enabling user transactions to continue to be per- 
formed on the table, except during a short final phase of those operations. 

Another object of the present invention is to avoid the use of "side files" for the above mentioned database recon- 
35 figuration operations, so as to reduce the overhead disk space requirements associated with these operations to almost 
zero. 

Another object of the present invention is to implement the above 'mentioned database reconfiguration operations 
using procedures that directly read from the transaction audit log, thereby making these database reconfiguration 
operations similar to a database recovery process. 
40 Another object of the present invention is to implement the database reconfiguration operations using well estab- 

lished, optimized, database recovery process procedures thereby making the database reconfiguration operations 
efficient in terms of computation time and in terms of the computer resources used. ' { 

Yet another object of the present invention is to utilize substantially similar database reconfiguration procedures 
for four distinct database reconfiguration operations, thereby improving the reliability of those procedures and simpli- 
45 tying maintenance of and updates to those procedures. 

SUMMARY OF THE INVENTION ' 

In summary, the present invention is a set of procedures for modifying the structure of a database table or index 
so "online," while the database table or index remains available for execution of transactions, with minimal impact on the 
availability of the database table for transaction execution. 

The present invention operates in a database computer system having memory, residing in a plurality of intercon- 
nected computer nodes, for storing database tables. Each database table or index has a plurality of columns, a primary 
key index based on a specified subset of the columns, and an associated table schema. In most implementations, at 
55 least some of the database tables or indexes are partitioned into a plurality of partitions, each partition storing records 
having primary key values in a primary key range distinct from the other partitions. 

A transaction manager generates and stores an audit trail, each audit entry denoting a database table or index 
record event, such as an addition, deletion or alteration of a specified database table or index record in a specified 
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one of the database tables or indexes. 

Four online data definition procedures allow the structure of a database table or index to be altered while the 
database table or index remains available for execut'on of transactions, with minimal impact of the availability of the 
database table or index for transaction execution. The four online data definition procedures are a Move Partition 

s procedure, a Split Partition procedure, a Move Partition Boundary procedure, and a Create Index procedure. Each of 
these online procedures has several phases of execution. In a first phase, the definitions of existing objects are read 
and any new objects that are needed to perform the requested procedure are created. Then, records of a table or index 
partition or the entire table are accessed, using "browse" access, so as to generate a new partition, to move records 
between two partitions, or to create a new index. Browse access is a form of read access that allows database records 

10 to be read through any record locks that may be in place. Thus, browse access will sometimes result in the reading of 
records that are in the midst of being modified. 

In a second phase, audit trail entries are accessed and the equivalent database table or index operations denoted 
in those audit trail entries are redone on the target objects, whenever necessary, to bring the data records created 
during the first phase up-to-date. In a third phase, access to the database table is briefly locked (A) while audit trail 

is entries created after the second phase are used to make final changes to the previously created data records : and 
then (B) while the database table or index schema is updated to reflect the changes to the database table or index 
produced. 

If a fourth phase, used by the Move Partition Boundary and Split Partition procedures, records in a database" 
partition that are inconsistent with the modified database table schema are deleted as a background operation while 
w use of the database table by transactions resumes. 

BRIEF DESCRIPTION OF THE DRAWINGS 



Additional objects and features of the invention will be more readily apparent from the following detailed description 
2S and appended claims when taken in conjunction with the drawings, in which: 

Figure 1 is a block diagram of a computer system having a database management system in accordance with the 
present invention. ; 

Figure 2A is a block diagram of a database table. Figure 2B is a block diagram of an alternate index. Figure 2C is 
a block diagram of the data structure of an Audit Traii. 
30 Figure 3 is a conceptual diagram of the three primary phases of the database restructuring procedures in the 

present invention. 

Figure 4 is a flowchart of the Move Partition procedure used in a preferred embodiment of the present invention. 
Figure 5 is a flowchart of the Split Partition procedure used in a preferred embodiment of the present invention. 
Figure 6 is a flowchart of the Move Partition Boundary procedure used in a preferred embodiment of the present 
35 invention. 

Figure 7 is a flowchart of the Create Index procedure used in a preferred embodiment of the present invention. 
DESCRIPTION OF THE PREFERRED EMBODIMENTS 

40 Referring to Figure 1 , there is shown a computer system 1 00 for storing and providing user access to data in stored 

databases. The system 100 is a distributed computer system having multiple computers 102, 104, 106 interconnected 
by local area and wide area network communication media 108. The system 100 generally includes at least one da- 
tabase server 102 and many user workstation computers or terminals 104, 106. 

When very large databases are stored in a system, the database tables will be partitioned, and different partitions 

45 of the database tables will often be stored in different database servers. However, from the viewpoint of user workstation 
computers 104, 106, the database server 102 appears to be a single entity. The partitioning of databases and the use 
of multiple database servers is well known to those skilled in the art. , 

As shown in Figure 1 , the database server 102 includes a central processing unit (CPU) 110, primary memory 112, 
a communications interface 114 for communicating with user workstations 104, 106 as well as other system resources 

so not relevant here. Secondary memory 116-1,11 6-2 : typically magnetic disc storage, in the database server 1 02 stores 
database tables 120, database indices 122, a database management system (DBMS) 124 for management of the 
database tables and associated data structures and resources, and one or more catalogs 126 for storing schema 
information about the database tables 120 as well directory information for programs used to access the database 
tables. The DBMS 124 includes an SQL executor 128 for executing SQL statements (i.e., database queries) and an 

ss SQL catalog manager 1 30 for maintenance of the catalogs 1 26 and for performing database definition and restructuring 
operations. The SQL catalog manager includes the "online DDL procedures*' 132 of the present invention for restruc- 
turing database tables while providing improved user transaction access to the affected tables. 

The database server 102 further includes a transaction manager 134 for managing transactions, and application 
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programs 136 that are utilized by users to perform transactions that utilize the database tables 120. The transaction 
manager creates audit entries for each transaction, which are durably stored in an audit trail file 138 in secondary 
memory. 

End user workstations 104, 106, typically include a central processing unit (CPU) 140, primary memory 142, a 
5 communications interface 1 44 for communicating with the database server 1 02 and other system resources, secondary 
memory 146, and a user interface 148. The user interface 148 typically includes a keyboard and display device, and 
may include additional resources such as a pointing device and printer. Secondary memory 146 is used for storing 
computer programs, such as communications software used to access the database server 1 02. Some end user work- 
stations 106 may be-'dumb" terminals that do not include any secondary memory 146, and thus execute only software 
io downloaded into primary memory 142 from a server computer, such as the database server 102 or a file server (not 
shown). 

Glossary 

is To assist the reader, the following glossary of terms used in this document is provided. 

SQL: SQL stands tor "Structured Query Language." Most commercial database servers utilize SQL. Any program 
for accessing data in a database that utilizes SQL is herein called an "SQL Program." Each statement in an SQL 
program used to access data in a database is called an "SQL statement." 

Object(s): An object is a file, database table or other encapsulated computer resource accessed by a program as 
20 a unitary structure. In the context of the preferred embodiment, objects are database tables or indexes. In other im- 
plementations of the present invention, objects may be other encapsulated computer resources that the end user 
accesses indirectly through validated methods (i.e. , programs) designed specifically to access those computer resourc- 
es. 

DDL Statement: a data definition language statement. DDL statements are used to create and modify database 
25 tablos. ! 

End user: a person using a workstation to access database information in a database server. End users typically 
do not have the authority to modify the structure of database tables. =; 

Operator: a person using a workstation who has the authority and access rights to modify the structure of database 
tables and to manually initiate compilation of SQL source code prograhns. 
30 •; 
Audit Trial, Database Table and Index Data Structures 

( 

Figure 2A shows the data structure of a typical database table 120-1. The table 120-1 includes a file label 160, 

which is essentially a compactly stored copy of the catalog information for the database table, which represents the 
35 table's schema as well as other information not relevant here. Next, the table includes a primary key B-tree 162. The 

use of B-trees in database files is well known to those of ordinary skill in the art of database management systems. 

Next, the table has a data array 1 63 organized in rows and columns. The rows of the table are often called "records" 1 64. 
In the context of preferred embodiments of the present invention/every database table has a primary index. The 

value of the primary index for a particular record is called the primary key, and the primary key is typically equal to 
40 either (A) the value in one field (i.e., column), (B) the concatenation of the values in a plurality of columns, or (C) a 

computed function of the values in one or more columns. The set of ^columns used to generate the primary key are 

represented by a vector herein called the PrimaryKeyColumns vector. There is a B Create_PrimaryKey" function for 

every database table, represented as follows: ; 
^ PrimaryKey = v 

Create_PrimaryKey(BaseTable(RecPtr), F;imaryKeyColumns) 

where RecPtr s a pointer to a database table record. 

It is often the case that an application program needs to access a database table in accordance with a set of 
so column values, at least some of which are not included in the primary index. When that is the case, a Create Index 
procedure can be used to create an efficient alternate access path to" the database table by ordering data according 
to the values in any specified set of columns. That ordering is represented by an "Alternate Index," which is typically 
implemented as a separate data structure from the associated database table. 

Figure 2B shows the data structure of an alternate index 170. The alternate index 170 includes a file label 172, 
ss which includes a compactly stored copy of the catalog information for the index. The altenate index also includes an 
alternate key B-tree 174 and then a data array 176 organized in rows and columns. The data array has two sets of 
columns herein called the AttKey columns and the PrimaryKey columns. 

The rows of the data array 176 are called records 178, and each row of the Alternate Index corresponds to one 
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record of the associated database table. Furthermore, each row of the Aitemate Index has two fields: one represents 
the aitemate key value for the corresponding database table record, and one represents the Primary Key value for the 
same database table record. 

The set of columns used to generate the alternate key for a particular database table are represented by a vector 
s herein called the AltemateKeyColumns vector. Thera is a w Create_AltKey" function for every alternate index of any 
database table, represented as follows: 

AltKey = Create_AllKey(BaseTable(RecPlr), PrimaryKeyColurnns) 

where RecPtr is a pointer to a database table record;. 
io For the purposes of the Move Partition, Split Partition and Move Partition Boundary operations, a database index 

may be viewed in the same way as a database tablel 

Figure 2C shows the data structure of an Audit Trail 138-1. The Audit Trail includes a file label 182 and a set of 

sequentially generated and stored audit entries 184. Each audit entry 184 denotes a database table or index record 

event such as the addition, deletion or alteration of a specified database table or index record in a specified database 
is table or index. 

Database Table Alteration Procedures 

Figure 3 is a conceptual representation of the procedure for modifying a database table or index. The commands 
20 for making database table or index alterations are Celled data definition language (DDL) statements, in the preferred 
embodiment, the DDL statements used are not changed, except for the use of an "ONLINE" option in the DDL state- 
ments to indicate that the DDL operation is to be performed while minimizing the impact of the DDL operation on user 
transactions. When the ONLINE option is specified, the preferred embodiment of the present invention changes how 
the SQL catalog manager 130 executes the Move Partition, Split Partition, Move Partition Boundary and Create Index 
25 commands. 

The procedure for modifying a database table's structure begins with a user or operator entering a DDL statement, 
specifying an alteration in the schema of a specified object 200, which is typically either a database table, an index, 
or a partition of a database table or index. The specified object is accessed to read its file label, and if the command 
requires generating a new object 202, the new object: is created. If the command involves movement of data between t 

30 the first object 200 and a second object 202 that already exists, the file label of the second object is also accessed. 

In the first phase of execution of the "online DDL command," a "dirty copy" of data from the first object 200 into 
the second object 202 is made while user transactions against the first object (and against the second object if it existed 
prior to the command) are allowed to continue unimpeded by the execution of the DDL command. In particular a "dirty 
copy" is made by accessing all the data records of the first object that are the subject of the DDL command, using 

35 "browse" access, and generating corresponding records in the second object 202 until the last of the relevant data 
records in the first object 200 have been accessed. - 

Browse access; is a form of read access that allows database records to be read through any record locks that 
may be in place. Thus, browse access will sometime^ result in the reading of records which are in the midst of being 
modified. Also, user transactions against the first objoct may delete or modify records in the first object while or after 

40 they are accessed by the first phase of the DDL command execution. As a result, the records created in the second 
object 202 may require a certain amount of correction. 

In the second phase of execution of the online DDL command, a partial clean-up of the records in the second 
object is performed by accessing records in the audit trail 204 associated with the first object 200 and performing 
corresponding "redo" operations against the second 'object. During this second phase user transactions against the 

45 first object (and against the second object if it existed prior to the command) are allowed to continue unimpeded by 
the execution of the DDL command. 

More specifically, before the dirty copy in the first phase of the online DDL command is executed, a pointer called 
AudilPtr is set to the end of the audit trail 204. During the second phase, each audit record beginning with the one 
referenced by the AuditPtr is inspected. If the audit record is relevant to operations performed on the first object 200, 

so or relevant to the subset of records of the first object that are the subject of the DDL command, then a redo operation 
is performed against the second object 202 based on the information in the audit record. In the case of a.Create Index 
command, the redo operation uses new audit records that are generated based on the audit entries found in the audit 
trail because tho second object has a very different structure than the first object; in the case of the other three online 
DDL commands, the redo operation uses the audit record as found in the audit trail except that the audit record is 

55 modified prior to the redo operation so as to reference the second object instead of the first object, and the redo 
operation is performed against the second object. 

The second phase continues until the end of the audit trail is reached. 

In the third phase, a lock is requested against the first object and second object (if it existed prior to the DDL 
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command) and when that lock (or those locks) is (are) granted all user transactions other than browse access trans- 
actions are blocked until the third phase is completed. During the third jDhase, any new audit records in the audit trail 
204 associated with the first object 200 are accessed and corresponding "redo" operations are performed against the 
second object 202. Next, if the first object is a portion of a larger database table or index, a lock against the entire 
s associated database table associated with the first object is obtained while catalog and file labels are updated. In 
particular the catalog entries and the file labels associated with the first and second objects are updated to reflect the 
results of the DDL operation. Furthermore, the catalog entries and file labels of all objects which incorporate schema 
information made invalid by the DDL operation are also updated. Then the lock on the first and second objects (and 
the lock, if any, on the associated database table or index) is released, enabling user transactions against the first and 

10 second objects to resume. • u - 

Finally, if the DLL command requires deletion of the first object or deletion of a range of records m the first object, 
that deletion operation is performed in a way that permits concurrent transaction activity. 

It is noted that the implementation of database reconfiguration operations using well established, optimized, da- 
tabase recovery process procedures makes the database reconfiguration operations efficient in terms of computation 
15 time and in terms of the computer resources used. 

Appendix 1 lists a pseudocode representation ol the Move Partition procedure used by the SQL catalog manager 
to move a database table partition from one disk to another. 

Appendix 2 lists a pseudocode representation of the Split Partition procedure used by the SQL executor to split a 
database table partition into two partitions. 
20 Appendix 3 lists a pseudocode representation of the Move Partition Boundary procedure used by the SQL catalog 

manager to move rows (i.e., records) of a database table between adjacent partitions (i.e., partitions with adjacent 
ranges of the table's primary index). 

Appendix 4 lists a pseudocode representation of the Create Index procedure used by the SQL catalog manager 
to create a new "alternate 11 index for a database table. 
25 Appendix 5 lists a pseudocode representation of an alternate embodiment of the Create Index procedure. 

Appendices 6 7 and 8 lists pseudocode representations of procedures used by the Create Index procedure. 
The pseudocode used in Appendices 1 through 8 is, essentially, a computer language using universal computer 
language conventions. While the pseudocode employed here has been invented solely for the purposes of this de- 
scription, it is designed to be easily understandable by any computer programmer skilled in the art. 
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Move Partition Procedure 



The function of the Move Partition procedure is to move a database table or index, or a specified partition of the 
table or index, from a first disk location to a second disk location. The^procedure is executed in response to a Move 

35 Partition command. The second disk location is typically located on a different disk from the first, and often will be 
located on a different computer node from the first. For the purposes of explaining the Move Partition procedure, it will 
be assumed that a partition of a database table is being moved, with the understanding that the same steps would be 
performed for moving an entire unpartitioned database file or for moving an index file or a partition of an index file. 
Referring to Figure 4 and Appendix 1, the steps of the Move Partition procedure are as follows. The database 

40 partition to be moved, herein called the Old Partition, is accessed, and a new file for the new database partition, herein 
called the New Partition, is created on the appropriate disk volume. Furthermore, an audit trail pointer, AuditPtr, is set 
to point to where the next record in the audit trail for transactions against the database table associated with the Old 
Partition will be located. 

In the first phase (220) of the Move Partition procedure, the records in the Old Partition are accessed using Browse 
45 access (i.e., read through locks), and records are copied from the Old Partition to the New Partition. While records are 
created in the New Partition, an up-to-date primary index B-tree is maintained for the New Partition. To prevent further 
repetition, it is noted that for all records created and updated by the online DDL procecures, a corresponding B-tree 
is updated. 

Transactions by end users against the Old Partition are allowed to continue during this phase of the Move Partition 
so procedure, and those transactions continue to insert, delete and update records in the Old Partition and create new 
audit trail entries in the audit trail. 

In the second phase (222) of the Move Partition procedure, transactions by end users against the Old Partition 
are allowed to continue, and those transactions insert, doloto and updato rocords in the Old Partition and croato now 
audit trail entries in the audit trail. i 
55 At the beginning of the second phase, an AuditTrail filter is established such that only Audit Trail records that 

pertain to the Old Partition are received for processing (by an Audit Fixup Process started by the SQL catalog manager). 
The processing of each audit record that passes the filter is called an "audit fixup" . 

Each received audit record that passes the filter is initially processed by modifying the audit record to refer to the 
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NewPartition. Next, the modified audit record is inspected to determine whether the change to the table noted in the 
audit record is already reflected in the copied records, "f so, the audit record is ignored. Otherwise, a "redo" with respect 
to the modified audit record is performed so as to apply the change noted in the modified audit record to the NewPar- 
tition. This process is repeated for each received audit record that passes the filter until the end of the audit trail is 
s reached. 

In the third phase (224) of the Move Partition procedure, a transaction request is made for a file lock on the Old 
Partition. This is a transaction request that goes in the lock queue. User transactions initiated before the third phase 
are not affected, but the lock request prevents users from committing transactions initiated after the lock request is 
made. When the requested lock is granted the remainder of the third phase is performed as a unitary transaction. The 
10 first part of the third phase transaction is performing the audit fixup process described above for all audit records 
referencing the Old Partition (i.e., that pass the filter) that were created after completion of the second phase. 

When processing of the last such audit record in the audit trail is completed, the third phase transaction requests 
a lock on the entire database table associated with the Old Partition. This prevents all user transactions on the entire 
database table until the lock is released. When the full table lock is granted, the catalog entry for the Old Partition is 
is - deleted and a new catalog entry for the New Partition is created. The file labels and catalog entries for all partitions of 
the database table are also updated so as to reference the New Partition. 

When all catalog and file label updates have been completed, the lock on the database table is released, enabling 
user transactions against the database table to resume. Finally, the Old Partition is deleted, the disk space used by 
the Old Partition file is released, and then the third phase transaction is concluded. 

20 « 
Split Partition Procedure 

The function of the Split Partition procedure is. to move a portion of a database table, or a portion of specified 
partition of a database table, from a first disk location to a second disk location. The procedure is executed in response 
25 to a Split Partition command. The second disk location is typically located on a different disk from the first, and often 
will be located on a different computer node from the first. For the purposes of explaining the Split Partition procedure, 
it will be assumed that a partition of a database file is being split into two partitions, with the understanding that the 
same steps would be performed for splitting a previously unpartitioned database file or for splitting an index file or a 
partition of an index file. 

30 Referring to Figure 5 and Appendix 2, the steps of the Split Partition procedure are as follows. The database 

partition to be split, herein called the Old Partition, is accessed, and a new file for the new database partition, herein 
called the New Partition, is created on the appropriate disk volume. Furthermore, an audit trail pointer, AuditPtr, is set 
to point to where the next record in the audit trail for transactions against the database table associated with the Old 
Partition will be located. 

35 The primary key ranges associated with the Old Partition and New Partition are as follows. The Old Partition's 

initial primary key range is called OldRange, and its new, reduced primary key range is called NewRange 1 . The primary 
key range for the New Partition is called NewRange2. 

In the first phase (230) of the Split Partition procedure, the records in the Old Partition having a primary key in 
NewRange2 are accessed using Browse access (i.e., read through locks), and are copied from the Old Partition to the 

40 New Partition. 

Transactions by end users against the Old Partition are allowed to continue during this phase of the Split Partition 
procedure, and those transactions insert, delete and update records in the Old Partition and create new audit trail 
entries in the audit trail. 

In the second phase (232) of tha Split Partition procedure, transactions by end users against the Old Partition are 
45 allowed to continue, and those transactions insert, delete and update records in the Old Partition and create new audit 
trail entries in the audit trail. 

At the beginning of the second phase, an AuditTrail filter is established such that only Audit Trail records that 
pertain to the Old Partition witn a primary key value in NewRange2 are received for processing. 

Each remaining received audit record is initially processed by modifying the audit record to refer to the NewPartition. 
so Next, the modified audit record is inspected to determine whether the change to the table noted in the audit record is 
already reflected in the copied records. If so, the audit record is ignored. Otherwise, a "redo" with respect to the modified 
audit record is performed so as to apply the change noied in the modified audit record to the NewPartition. This process 
is repeated for each received audit record that passes the filter until the end of the audit trail is reached. 

In the third phase (234) of the Split Partition procedure, a transaction request is made for a lock on the Old Partition. 
55 Jhis is a transaction request that goes in the lock queue. User transactions initiated before the third phase are not 
affected, but the lock request prevents users from committing transactions initiated after the lock request is made. 
When the requested lock is granted the remainder of the third phase is performed as a unitary transaction. The first 
part of the third phase transaction is performing the audit fixup process described above for all audit records referencing 
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the Old Partition that were created after completion of the second phase and for which either the old record image or 
new record image has a primary key value in NewRange2 (i.e., that pass the fitter). 

When processing of the last such audit record in the audit trail is completed, the B-tree for Old Partition is split into 
two B-trees, or e for records having a primary key in NewRangel and the other for records having a primary key in 
s NewRange2. 

Then the third phase transaction requests a lock on the entire database table associated with the Old Partition. 
This permits existing user transactions that work on the table to complete, but suspends all new user transactions on 
the entire database'-table until the lock is released. When the full tablfe lock is granted, the catalog entry for the Old 
Partition is deleted and a catalog entry for the New Partition is created. The file labels and catalog entries for all partitions 
10 of the database table are also updated so as to reference the New Partition. 

When all catalog and file label updates have been completed, the lock on the database table is released, enabling 
user transactions against the database table to resume. Finally, an access check is put in place to make application 
transactions unable to access records in the NewRange2 B-tree of the OldPartition, and then the third phase transact; 
on is concluded. 

is All records in the NewRange2 part of the Old Partition are deleted without interfering with concurrent application 

transactions against the table, and then the access check for the OldPartition is removed. 

Move Partition Boundary Procedure 

20 The function of the Move Partition Boundary procedure is to move records in a specified range of, primary key 

values from a first specified partition of a database table to a second adjacent partition of that database table. The 
second partition is typically located on a different disk from the first, and often will be located on a different computer 
node from the first. The procedure is executed in response to a Move Partition Boundary command. For the purposes 
of explaining the Move Partition Boundary procedure, it will be assumed that records are being moved between adjacent 

25 database file partitions, with tho understanding that tho same stops would bo porformod for moving records botwoon 
adjacent index file partitions. 

Referring to Figure 6 and Appendix 3, the steps of the Move Partition Boundary procedure are as follows. The 
database partition from which records: are to be moved is herein called Partitionl and- the database partition to which 
those records are to be moved is called Partition2. An audit trail pointer, AuditPtr, is set to point to where the next 

30 record in the audit trail for transactions against the database table will be located. 

The primary key ranges associated with Partitionl and Partition 2 are as follows. Partition's initial primary key 
range is called OldRangel , and its new. reduced primary key range is called NewRangel . The initial primary key range 
for the Partition2 is called OldRange2 and its new expanded key ranga is called NewRange2. The primary key range 
of the records to be moved, called MoveRange, is equal to the range of primary key values in NewRange2 that are 

35 not in OldRange2. 

In the first phase (240) of the Move Partition Boundary procedure, the records in the Old Partition having a primary 
key in the MoveRange are accessed using Browse access (i.e., read through locks), and are copied from Partitionl 
to Partition2. 

Transactions by end users against Partitionl and Partition2 are allowed to continue during this phase of the Move 
40 Partition Boundary procedure, and those transactions insert, delete and update records in Partitionl and Partition2 
and create new audit trail entries in the audit trail. 

In the second phase (242) of the Move Partition Boundary procedure, transactions by end users against Partitionl 
and Partition2 are allowed to continue, and those transactions insert, delete and update records in Partitionl and 
Partition2 and create additional new audit trail entries in the audit trail. 
45 At the beginning of the second phase, an AuditTrail filter is established such that only Audit Trail records that 

pertain to the Partitionl are received for processing by the SQL catalog manager. Furthermore, those audit records for 
which neither the old record image (if any) nor the new record image (if any) have a primary key value in the MoveRange 
are also ignored. ^ 

Each remaining received audit record is initially processed by modifying the audit record to refer to the Partition2. 
so Next, the modified audit record is inspected to determine whether the' change to the table noted in the audit record is 
already reflected in the copied records. If so, the audit record is ignored, Otherwise, a "redo" with respect to the modified 
audit record is performed so as to apply the change noted in the modified audit record to Partition2. This process is 
repeated for each received audit record that passes the filter until the end of the audit trail is reached. . 

In the third phase (244) of the Move Partition Boundary procedure, a transaction request is made for a lock on the 
S5 Old Partition. This is a transaction request that goes in the lock queue. User transactions initiated before the third phase 
are not affected, but the lock request prevents users from committing transactions initiated after the lock request is 
made. When the requested lock is granted the remainder of the third phase is performed as a unitary transaction. The 
first part of the third phase transaction is performing the audit fixup process described above for all audit records 
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referencing Partition! that were created after completion of the second phase and for which either the old record image 
or new record image has a primary key value in the MoveRange (i.e., that pass the filter). 

When processing of the last such audit record in the audit trail is completed, the B-tree for Partition 1 is split into 
two B-trees, one for records having a primary key in NewRangel and the other for records having a primary key in the 
5 MoveRange. 

Then the third phase transaction requests a lock on the entire database table associated with Partitionl and 
Partition2. This permits existing user transactions thai work on the table to complete, but suspends all new user trans- 
actions on the entire database table until the lock is released. When the full table lock is granted, the catalog entries 
and file labels for Partitionl and Partition2 are updateci to reflect their new primary key ranges. Furthermore, the catalog 
10 entries for all partitions of the database table are alt'o updated so as to reference the key ranges ot Partitionl and 
Partition2. 

When all catalog and file label updates have beeh completed, the lock on the database table is released, enabling 
user transactions against the database table to be initiated and executed. Finally, an access check is put in place to 
make application transactions unable to access records in the MoveRange B-tree of the OldPartition, and then the 
15 third phase transaction is concluded. 

All records in the MoveRange part of the Old Partition are deleted without interfering with concurrent application 
transactions against the table, and then the access check for the OldPartition is removed. 

Create Index Procedure 

20 

The function of the Create Index procedure is to create an alternate index for a database table, where the alternate 
index is generally a different function of the database table columns than the primary index for the database table. 

Referring to Figure 6 and Appendices 4, 5, 6 and 7, the steps of the Create Index procedure are as follows. The 
database object from which records are to be indexe d is herein called the Base Table and the alternate index being 
2$ created is called the Nowindox. The sot of columns usod to generate- the alternate key for tho Nowlndex are represented 
by a vector called the AltemateKeyColumns vector. Furthermore, a flag called UniquelndexFlag is set to True if the 
Newlndex is to be a unique index, which means that every record in the Base Table must have a unique alternate index 
value, and is set to False if multiple records in the Base Table are allowed to have the same alternate index value. 

An audit trail pointer, AuditPtr, is set to point to where the next record in the audit trail for transactions against the 
30 database table will be located. 

In the first phase (250) of the Create Index procedure, the records in the Base Table are accessed using Browse 
access (i.e., read through locks), and for each such record a "create new index record" procedure 251 is performed. 
The "create new index record" procedure 251 , which is also used in the later phases of the Create Index procedure, 
begins by computing primary and alternate index values for a particular record using the index generation functions: 

35 

PrimaryKeyl = Create_Prinr!ary Key (Record, PrimaryKeyColumns) 

AftKeyl = Create_Alt Kay (Record, PrimaryKeyColumns) 

If the UniquelndexFlag is False, an index record with.AltKeyl and PrimaryKeyl as its two fields is stored in the New- 
40 Index. 

If the UniquelndexFlag is set to True, prior to storing an index record with AitKeyl and PrimaryKeyl as its two 
fields in the Newlndex, the Newlndex is searched to soe if Newlndex already has a record with an alternate index value 
of AitKeyl . If such a record is found in the Newlndex, a potential duplicate index problem exists. To resolve whether 
a duplicate index problem has been encountered; two tests are perfonned. First, a repeatable read transaction is 
45 performed on the Base Table to see if the Base Table still stores a record with primary and alternate key values of 
PrimaryKeyl and AitKeyl. If such a record is found in the Base Table, the second test is performed. 

The existing Newlndex record with an alternate key value of AitKeyl is read (with a repeatable read operation) to 
determine the primary key value, PrimaryKey2, stored with it. If this Newlndex record still exists, a second repeatable 
read transaction is performed on the Base Table to see if the Base Table stores a record with a primary key value of 
so PrimaryKey2 and an alternate key value of AitKeyl. If both tests produce positive results, the Create Index procedure 
is aborted because two database records with identical alternate key values have been encountered. Otherwise, if 
either test returns a negative result, an index record with AitKeyl and PrimaryKeyl as its two fields is stored in the 
Newlndex, and the index record with AitKeyl and Pri,maryKey2 (if any) is deleted from the Newlndex. 

In the second phase (252) of the Create Index procedure, transactions by end users against the Base Table are 
55 allowed to cortinue, and those transactions insert, delete and update records in the Base Table and create additional 
new audit trail entries in the audit trail. 

At the beginning of the second phase, an Audit Trail filter is established such that only Audit Trail records that 
pertain to the Base Table are received for processir g by the SQL catalog manager. Each received audit record is 



EP 0 723 238 A1 



processed by an "audit fixup for create index" procedure 253 specially .designed for use as part of the Create Index 
procedure. 

The "audit fixup for create index" procedure 253 progresses an audit record as follows. If the audit record denotes 
a change to a Base Table record that does not alter either the primary key or the alternate key of the record, or the 

5 change is already reflected in the Newlndex, the audit record is ignored. 

If the audit reword denotes deletion of a Base Table record, an audit 'record representing deletion of the associated 
Newlndex record is constructed and a "redo" of the newly created audit record against the Newlndex is performed. As 
a result, the associated Newlndex record is deleted, if one exists. 

If the audit record denotes addition of a Base Table record, an audit record representing insertion of a corresponding 

w Newlndex record is constructed and a "redo" of the newly created audit record against the Newlndex is performed. If 
a unique alternate index is being created and a duplicate Newlndex record with the same primary and alternate key 
values is found during this procedure, the index creation procedure is aborted. Otherwise, the aforementioned steps 
result in a new index record being stored in the Newlndex for the inserted Base Table record. 

Finally, if the audit record indicates that either the primary index or alternate index of a record have been altered, 

is then a first audit record representing deletion of the Newlndex record associated with the record's old value is con- 
structed, and a second audit record representing insertion of a new Newlndex record corresponding to the record's 
new value is constructed. Redo's of both constructed audit records are then performed against the Newlndex. If a 
unique alternate index is being created and a duplicate Newlndex record with the same primary and alternate key 
values is found during this procedure, the index creation procedure is aborted. Otherwise, the aforementioned steps 

20 result in a new index recoded being stored in Ihe Newlndex for the altered Base Table record, and deletion of the prior 
Newlndex record for that Base Table record. 

In the third phase (254) of the Create Index procedure, a transaction request is made for a lock on the Base Table. 
This is a transaction request that goes in the lock queue. User transactions initiated before the third phase are not 
affected, but the lock request prevents users from committing transactions initiated after the lock request is made. 

25 When the roquosted lock is granted the romaindor of the third phase is performed as a unitary transaction. The first 
part of the third phase transaction is performing the "audit fixup for create index" process 253 described above for all 
audit records referencing the Base Table that were created after completion of the second phase (i.e., that pass the 
filter). 

When processing of the last such audit record in the audit trail is completed, a catalog entry for the Newlndex is 
30 created, and file labels for all partitions of the Base Table are updated to reflect the existence of the Newlndex. Then 
the lock on the Base Table is released, the third phase transaction is concluded, and the Create Index procedure is 
completed. 

In an alternate embodiment, represented in Appendix 5, the first phase of the Create Index procedure utilizes an 
indexing procedure that processes the entire Base Table, even if records that potentially have duplicate alternate key 
35 values are encountered. All such potential duplicate records are copied to a scratch pad memory area, and then are 
processed by the "create new index record" procedure 251 to determine if records with duplicate alternate key values 
in fact exist, and to add additional records to the Newlndex if no duplicate alternate key values are found. 

ALTERNATE EMBODIMENTS 

40 I 

While the present invention has been described with reference to a few specific embodiments, the description is 
illustrative of the invention and is not to be construed as limiting the invention. Various modifications may occur to those 
skilled in the art without departing from the true spirit and scope of the invention as defined by the appended claims. 

45 
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APPENDIX 1 

Pseudocode Representation of Move Partit on Procedure 

Procedure MovePartition (OldPartition, NewPartition) 
1* OldPartition identifies the partition to be moved 

NewPartition identifies the new partition to be created 

BaseTable Is the database table associated with the OldPartition 7 

{ 

/* Phase 1: Dirty Copy 7 

Access current database partition (OldPartition) to be moved 
Create file for new database partition (NewPartition) 
Create file label for the NewPartition 
AuditPtr = End of Audit Trail 

r AuditPtr initially points to where the next record in the Audit Trail will 
be located, when it is generated 7 

While accessing records in the OldPartition, using Browse access (i.e., read 
through locks), copy records from the OldPartition to the NewPartition 

Maintain an up-to-date primary index B-tree for each record copied into the 
NewPartition 

All transactions by end users continue to insert, delete and update records in 
the OldPartition and to create new audit trail entties 

r Phase 2: Partial Cleanup 7 

Establish AuditTrail filter: Access only Audit Trail records that pertain to the 

OldPartition 
Do Until End of Audit Trail is reached 

{ 

r Process entry in Audit Trail at AuditPtr */ 

Modify the Audit Record to refer to the NewPartition 

If the change to the table noted in the audit record is already reflected in 

the copied records 

{ Ignore Audit Record } 
Else 

{ 

Perform a "Redo" with respect to the modified Audit Record so as to 
apply the change noted in the Audit Record to the NewPartition 
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Updatu B-tree for the NewPartition for eaqh i.pdate to the 
NewPartition 

Update AuditPtr to Next Audit Record (skip recor ds that do not pertain to 

the OldPartition) 
} r End Do Loop */ 

10 

r Phase 3: Final Cleanup V 

Request and Wait for Lock on the OldPartition 

r This is a transaction request that goes in the loci; queue. This does not 
affect transactions initiated before Phase 3, but prevents users from 
committing transactions initiated after the Lock Request is made. 7 

r When Lock is granted ... •/ 
20 Do As A Unitary Transaction 

{ 

LastPtr = Last Record in Audit Trail 

r Previously established Audit Trail Filter cpntl: lued to filter out audit 
25 records not applicable to the OldPartition 7 

Do Until E nd of AuditPtr reaches LastPtr. 

{ 

30 Modify the Audit Record to refer to the NewPartition 

Perform a 'Redo' with respect to the modified Audit Record so as to 
apply the change noted in the Audit Record to the NewPartition 
Update B-tree for the NewPartition for each update to the 

35 NewPartition 

Update AuditPtr to Next Audit Record (skip racords that do not 
40 pertain to the OldPartition) 

} r End Do Loop */ 

Request Lock on the entire Base Table t 
45 When Lock is granted: 

Deleto Catalog entry for the OldPartition 
Create Catalog entry for the NewPartition 
so Update file labels and catalog entries for all partitions of BaseTable 

so as to reference the NewPartition 

Release Lock on BaseTable, enabling user transactions against the 
55 Base Table to resume execution 
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Delete the OldPartition and release disk space used by the OldPartition 
file 

} T End ot transaction */ 



Return 
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APPENDIX 2 

Psejdocode Representation of Split Partition Procedure 

Procedure SpttPartition (BaseTable, OldPartition, OldRange, NewRangel, 
NewRang«2, NewPartition) 

7* OldPartition identifies the partition to be split 

NewPartition identifies the new partition to be created 

OldRange is the range of Primary Key values currently assigned to 

the OldPartition 

NewRangel is the new range of Primary Key values assigned to the 
OldPartition 

NewRange2 is the range of Primary Key vak es assigned to the 
NewPartition to be created 

BaseTable is the database table associated with the OldPartition 

*/ 

Access current database partition (OldPartition) to be split 
Create file for new database partition (NewPartition) 
Create file label for the NewPartition 
AuditPtr = End of Audit Trail 

r AuditPtr initially points to where the next record in the Audit Trail will 
be located, when H is generated •/ 

f Phase 1: Dirty Copy*/ 

While accessing records in the OldPartition, using Browse access (i.e., read 
through locks), copy records having a Primary Key value in NewRange2 
from the OldPartition to the NewPartition •" 

Maintain an up-to-date primary index B-tree for each record copied into the 
NewPartition 

All transactions by end users continue to insert, delete and update records in 
the OldPartition and to create new audit traif ent ies 

r Phase 2: Partial Cleanup V 

Establish AucitTrail filter: Access only Audit Trail records that pertain to the 

OldPartition with a Primary Key value in NewRange2 
Do Until End of Audit Trail is reached 

{ 
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F Process entry in Audit Trail at AuditPtr 7 

Modify the Audit Record to refer to Partition2 

If the change to the table noted in the audit record is not already 

reflected in the copied records 

{ 

Perform a "Redo* with respect to the modified Audit Record so as to 

apply the change noted in the Audit Record to Partition2 
Update B-tree for Partition2 for each update to Partition2 
} 

Update AuditPtr to Next Audit Record (skip records that do not pertain to 

Partitionl or with a Primary Key value not in NewRange2) 
} r End Do Loop 7 

I* Phase 3: Final Cleanup 7 
Request Lock on the Old Partition 

P This is a transaction request that goes in the lock queue. This does not 
affect transactions initiated before Phase 3 ( but prevents users from 
committing transactions initiated after the Lock Request is made, 7 

t* When Lock is granted ... 7 

Do as a Unitary Transaction: 
{ 

LastPtr = Last Record in Audit Trail 

r Previously established Audit Trail Filter continued to filter out audit 

records not applicable to NewRange2 in the OldPartition 7 
Do Until End of AuditPtr reaches LastPtr 

{ 

Modify the Audit Record to refer to the NewPartition 

Perfoim a "Redo" with respect to the modified Audit Record so as to 

apply the change noted in the Audit Record to the NewPartition 
Update B-tree for the NewPartition for each update to the 

NewPartition 

Update AuditPtr to Next Audit Record (skip records that do not 
pertain to NewRange2 in the OldPartition) 

} 

} r End Do Loop 7 
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Split the E;-tree for Partitionl into two B-trees, ona for records in 

NewRangel (those remaining in Partition 1 !) and one for the records 
in NewRange2 to be removed from Partitionl (the Prune B-tree). 

Request L ock on the entire Base Table 
When Lock is granted: 

Update Catalog entry for the OldPartition to reflect new index 

boundaries for the OldPartition 
Update label for the OldPartition, including putting in place an 

access check that prevents user transaction from accessing 

NewRange2 
Create Catalog entry for the NewPartitipn 
Update file labels and catalog entries for" all partitions of BaseTable 

to reference the NewPartition 

Release Lock on BaseTable, enabling user transactions against the 
Base Table to resume execution 

} / * end of unitary transaction for third phase / 

/* Phase 4: Background garbage collection 7 
In background mode: 

Delete all records from the OldPartition that; are leferenced by the Prune 
B-tree (this releases unused space from the OldPartition file) 

Remove access check in the OldPartition 
Return 
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APPENDIX 3 

Pseudocode Representation of Move Partition Eoundary Procedure 

Procedure MovePartitionBoundary (Partition!, OldRengel, NewRanget, 
Partition2, OldRange2, NewRange2) 

r BaseTable is database table associated with Partitionl and 
Partition2 

Partitionl and Partition2 identify two partitions having adjacent 
assigned Primary Key Ranges 

OldRangel is the range of Primary Key values currently assigned to 
Partitionl 

OldRange2 is the range of Primary Key valuas currently assigned to 
Partition2 

NewRangel is the new range of Primary Ke/ values assigned to 
Partitionl 

NewRange2 is the new range of Primary Ke/ values assigned to 
Partition2 

7 

Access database Partitionl and Partition2 
AuditPtr = End of Audit Trail 

T AuditPtr initially points to where the next record in the Audit Trail will 
be located, when it is generated 7 
Generate MoveRange = range of Primary Key values for records in 

NewRancie2 that are not in OldRange2 

f Phase 1: Dirty Copy 7 

While accessing records in Partition! using Browse access, copy records 

having a Primary Key value in tyoveRange from Partitionl to Partition2 
Update primary index B-tree for Partition2 for each record copied into 

Partition2. Leave B-tree for Partitionl unchange d. 
All transactions by end users continue (A) to Insert, delete and update 

records in Partitionl and Partition2 using old paction range boundaries 

and (B) to create new audit trail entries 
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/* Phase 2: Partial Cleanup 7 

Establish AuditTrait filter Access only Audit Trail records that Rertain to 

Partitionl with Primary Key values In MoveRange 
Do Until End of Audit Trail is reached 

{ 

T Process entry in Audit Trail at AuditPtr 7 
Modify the Audit Record to refer to Partition^ 
If the change to the table noted in the audit record is not already 
reflected in the copied records 

Perform a "Redo" with respect to the modified Audit Record so as to 

apply the change noted in the Audit Reco rd to Partition2 
Updato B-tree for Partition2 for each update 10 Partition2 

} 

Update AuditPtr to Next Audit Record (skip records that do not pertain to 

Partitionl or with a Primary Key value not in MoveRange) 
} r End Do Loop 7 

f Phase 3: Final Cleanup 7 

Request Lock on Partitionl and Partition2 

r This is a transaction request that goes in the loci queue. This does not 
affect transactions initiated before Phase 3, but prevents users from 
committing transactions initiated after the Lock Request is made. 7 

r When Lock is granted ... V 

Do as a Unitary Transaction: 
{ 

LastPtr = Last Record in Audit Trail 

T Previously established Audit Trail Fitter continued to filter out audit 

records not applicable to MoveRange in Partitionl 7 
Do Until End of AuditPtr reaches LastPtr 

{ r 

Modify the Audit Record to refer to Partition2 

Perform a "Redo" with respect to the modified Audit Record so as to 
apply the change noted in the Audit Record to Partition2 
Update B-tree for Partition2 for each update to Partition2 

Update AuditPtr to Next Audit Record (skip records that do not 
pertain to MoveRange in Partitionl) 

} r End Do Loop 7 
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Split the B tree for Partition 1 into two B-trees, one for records in 

NewRangel (those remaining in Partition!) and one for the records 
in the MoveRange to be removed from Partitionl (the Prune B-tree). 

Request Lock on entire Base Table 
When Lock Is granted: 

Update Catalog entry for Partitionl to reflect new partition key 

boundaries for Partitionl 
Update label for Partitionl, including putting in place an access 
check that prevents user transactions from accessing 
MoveRange 

Updato Catalog entry for Partition2 to reflect i iew partition key 

bojndaries for Partition2 
Update label for Partition2 

Updato file labels and catalog entries for all partitions of BaseTable 
to reflect the new parti lion key boundaries for Partitionl and 
Partition2 

Release Locks on BaseTable (including Partitionl and Partition2), 
enabling user transactions against the Base Table to resume 
execution 

} / * end of unitary transaction for third phase * 

I* Phase 4: Background garbage collection 7 
In background mode: 

Delete all records from Partitionl that are referenced by the Prune 
B-tree (this releases unused space from Partitionl file). 

Remove access check in Partitionl 
Return 
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APPENDIX 4 

Pseudocode Representation of Create Indox Procedure 

Createlndex Procedure (BaseTable, AltlndexColumni , UniquelndexFlag, 
Newlndex} 

r BaseTable Is table for which index is to be created. 
' AltlndoxColumns is a vector listing the table columns that define the 
key for the New Index. 

UniquelndexFlag is equal to True if the new index must be a unique 
index (i.e., an index for which every baso table record has a unique 
alternate key value). 

Newlndex is the new index to be created 
PrimarylndexColumns is a vector listing the table columns that 
define the key for the Primary Index of the Base Table 

*/ 

r Structure of Newlndex (and every other index, other than the primary 
index) is: 

File Label - embedded copy of catalog information for index 
B-tree - the B-tree for the index 
Index Records, where each index record has two fields: 
AflKey, PrimaryKey 

where the AltKey field stores the value o the AltlndexColumns 
for one record of the BaseTable, and the PrimaryKey field 
stores the value of the primary key (i.e., the 
PrimarylndexColumns) for that same record 

V . 

Create File for Newlndex, including root node for the new index's B-tree 
Create file label for Newlndex 
AuditPtr = End of Audit Trail 

r AuditPtr initially points to where the next record in the Audit Trail will 
be located, when it is generated */ 
RecPtr = Firsl Record of BaseTable 
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r Phase 1: Dirty Copy V 
Do Until End of BaseTable is Reached 
{ 

Access, using Browse access, BaseTable(RecPt ) 
AltKeyl = Create_AttKey( BasoTable(RecPtr), AltlndexColumns) 
PrimaryKGyl = Create_PrimaryKey( BaseTable(F tecPtr), 
PrimarylndexColumns ) 

r Check lor Duplicate Index Values only if UniqielndexFlag is True*/ 
If UniquelndexFlag 

{ 

Search Newlndex for an index record with 

Newlndex.AltKey equal to AltKeyl 
If a match is found 

{ 

Call CheckDupRecord (BaseTable, Newlndex, 

BaseTable(RecPtrl) 
If ReturnCode s "fair 

Abort Createlndex Procedure 

Else 

Call CreatelndexRecord (Newlndex, AltKeyl, PrimaryKeyl) 

} 

Else / * UniquelndexFlag is False 7 

Call CreatelndexRecord (Newlndex, AltKeyl, PrimaryKeyl) 

Incremen: RecPtr to point to nsxt BaseTable rec ord, if any 
} r end of Do loop 7 

r Phase 2: Partial Cleanup V 

Establish AuciitTrail filter Access only Audit Trail records that pertain to the 

Base Table 
Do Until End of Audit Trail is reached 

{ 

Call AuditRecord_to_Newlndex (AuditPtr, BaseTable, Newlndex, 

IndexPtr) for the current Audit Record 
Update AuditPtr to Next Audit Record (skip records that do not pertain to 

the Base Table) 

} 
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r Phase 3: Final Cleanup 7 
Request Lock on the Base Table 

r This is a transaction request that goes in the loci, queue. This does not 
affect transactions initiated before Phase 3, but prevents users from 
committing transactions initiated after the Lock Request is made. 7 
r When Lock is granted ... 7 
Do as a Unitary Transaction: 

{ - 

LastPtr = l_ast Record in Audit Trail 

/* Previously established Audit Trail Filter contii iued to filter out audit 

records not applicable to the Base Tablo */ 
Do Until End of AuditPtr = LastPtr 
{ 

Call AuditRecordjo.Newlndex (AuditPtr, Ba ;eTable, Newlndex) for 

tha current Audit Record 
Update AuditPtr to Next Audit Record (skip records that do not 

pertain to the Base Table) 
} r End Do Loop 7 

30 Create Catalog entry for Newlndex 

Update filo labels and catalog entries for all parti ions of BaseTable to 
reference the Newlndex 



20 



25 



35 



40 



Release Lock on BaseTable, enabling user transactions against the 

Base Table to resume execution 
} / * end of unitary transaction for third phase / 



Return 
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APPENDIX 5 

Pseudocode for Second Preferred Embodiment 
of Phase 1 of Create Index Proce dure 

r Phase 1 : Dirty Copy 7 
Initialize Scratchpad pointer SP_Ptr 
Do Until End of BaseTable is Reached 
{ 

Access, using Browse access^ BaseTable(RecPt ) 
AltKeyl = Create_AltKey( BaseTable(RecPtr), AMndexColumns) 
PrimaryKeyl = Create_PrimaryKey( BaseTable(RecPtr), 
PrimarylndexColurnns ) 

r Check f or Duplicate Index Values only if Uniquelndex is True*/ 
If UniquelndexFlag 

{ 

Search Newlndex for an index record with 

Nowlndex.AltKey equal to AltKeyl 
If a match is found 

{ 

Store RecPtr in ScratchPad(SP_Ptr) 

Increment SP_Ptr 

} 

Else 

Ccill CreatelndexRecoid (Newlndex, AltK.syl, PrimaryKeyl) 

} 

Else / * UniquelndexFlag is False V 

Call CreatelndexRecord (Newlndex, AltKeyl, PrimaryKeyl) 

Increment RecPtr to point to next BaseTable record, if any 
} I* end of Do loop 7 

r Process Records in Scratchpad V 
Initialize Scratchpad pointer SP_Rr 
Do Until End of Scratchpad is reached 
{ 

RecPtr s JJcratchPad(SP_Ptr) 

Call ChecKDupRecord (BaseTable, Newlndex, BrseTable(RecPtr)) 
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If RetumCode = "fair 

Abort Createlndex Procedure 
Increment SP_Ptr to point to next Scratchpad entry, if any 

} 

f end of Phase 1 */ 

APPENDIX 6 

Pseudocode Representation of Create Index Pecord Procedure 

Procedure CreatelndexRecord (Newlndex, NewAltKey, NewPrimaryKey) 

Newlndex Is the new index in which a new record is to be created. 
NewAltKey is the alternate key of a record in the BaseTable for 

which an index record is to be created. 
NewPrimaryKey is the primary key of a jecord in the BaseTable for 
which an index record is to be created. 



Create NewlndexRecord(AltKey,PrimaryKey): 

NewlndexRecord.AltKey = NewAltKey ; 

NewlndexRecord.PrimaryKey = NewPrimaryKey 
If NewlndexRecord is already present in Newlndex 

Return 
Else 

{ 

Store NewlndexRecord in Newlndex 
Add B tree entry for the NewlndexRecord 

) 

Return 



r 
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APPENDIX 7 

i 

Pseudocode Representation of Check Duplicate Record Procedure 

Procedure Ch»3ckDupRecord (BaseTable, Newlndex, DupRecord) 

r BaseTable is a database table 

Newlndex is an index file being created for the BaseTable 
DupRecord is an index record whose AltKey value is the same as 
th>3 AltKey value of an already existing index record 

V 

r Generate alternate key value and primary key value from DupRecord 7 
AltKeyValue2 =* Create_AltKey( DupRecord, AltlndexColumns) 
PrimaryKeyValue2 = Create.PrimaryKey(DupRecord PrimarylndexColumns) 

As a unitary transaction: 
{ 

Access tho base table record, if any, having a primary key value of 

PrimaryKeyValue2, requesting a lock on the requested record 
If Record=tBaseTable(PrimaryKeyValue2) is foun<i in base table 

{ 

r Extract the alternate key value for that base record 7 
AltKeyValueY = Record(PrimaryKeyValue2). AltKey 
If AHKeyValueY * AltKeyValue2 

TransactionReturn ("DupRecord not found") 

} 

Else 

Trans;actionRetum ('DupRecord not found") 

} 

If TransactionReturn = "DupRecord not found" 

r A Base Table record for DupRecord was not found, indicating that 
an incex record for DupRecord is not needed. 7 

{ 

Return (S jccess) 
} 

P A base taole record matching DupRecord's Prim ary and Alternate keys 
was found in base table. 
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25 



First step: Read the already existing Newlndex record whose alternate 
key value is DupRecord.AltKey. 

Next step: Check to see if a record with the primary key value found in 
this Newlndex record is still in Base Table , 

7 

PrimaryKeyValuel = Newlndex(AltKeyValue2).PnmaryKey 
As a unitary transaction: 

{ . 

Access tho base table record, if any. having a primary key value of 
75 PrimaryKeyValuel, requesting a lock on the requested record 

If record is found in base table 
{ 

/* Extract the alternate key value for that bas-3 record '/ 
AltKeyValueX = BaseTable(PrimaryKeyValuei).AItKey 
If AltKoyValiieX * AltKeyValue2 

TnansactionReturn ('conflicting record no: found") 

} 

Else 

TranssictionRetum ("conflicting record not found") 

30 } 

If TransactionRetum » "conflicting record not found" 
{ 

I* No Duplicate Record Problem because original record has been 

deleted or altered */ 
Delete Newlndex(AltKeyValue2) ' 
40 Call CreatelndexRecord (Newlndex, AltKey2, PrirnaryKey2) 

Return (Success) 
) 

Else 
{ 

/* a conflicting base record was found 7 
Return (Fail, "Duplicate key value found") 
} 

r End of CheckDupRecord Procedure 7 

ss 
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APPENDIX 8 

Pseudocode Representation of 'Audit Record to Newlndex" Procedure 

Procedure AuditRecord__to_New!ndex (AuditPtr, BaseTable, Newlndex) 
r BaseTable is database table 
Newlndex is index file 
AuditPtr is a pointer to an Audit Trail record 

V 

r Ignore irrelevant Audit Record V 

If Audit Record denotes a change to a base table record that does not 
alter either the PrimaryKey or AltKey of the rocord 
{ Ignore Audit Record } 

r Record deleted from Base Table 7 

Elself Audit Record denotes deletion of a base U ble record 
{ 

Creat€i an audit record representing deletion of the associated 
Newlndex record 

If this record exists in Newlndex (i.e., matching both the PrimaryKey 
and AltKey of the deleted base table record) 

Rodo the newly created audit record against Newlndex 

} 

r New Record Added to Base Table V 

Elself Audit Record denotes addition of a record to the base table 

{ 

Create; an audit record representing insertion of the associated 

Nowlndex record 
If UniquelndexFlag 

{ 

AltKeyl = Create_AltKey( Inserted Recoid, AltlndexCoIumns) 
Search Newlndex for an index record with 

Newlndex.AltKey equal to AltKeyl 
If a match is found 

{ 

Call CheckDupRecord (BaseTable, I ewlndex, Inserted 
Record) 
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If RetumCode = "fail" 

Abort Createlndex Procedure 

} 

Eli.e 

Redo the newly created audit record igainst Newlndex 

} 

Else / * UniquelndexFlag is False */ 

Redo the newly created audit record against Newlndex 

} 

r Primary Key and/or AltKey in Base Table record was altered V 
Elself audit record denotes alteration of a base trble record 
{ 

Create a first audit record representing deletion of the Newlndex 
record associated with the base table record's old value 

If this record exists in Newlndex (i.e.. matching both the PrimaryKey 

and AltKey of the base table record's old valte) 

Redo the first newly created audit record against Newlndex 

Create a second audit record representing insertion of the Newlndex 
record associated with the base table record's new (altered) 
value : 

If UnicuelndexFlag 

{ 

AHKeyt « Create_AltKey( Altered Record, AltlndexColumns) 
Search Newlndex for an index record with 

Newlndex.AltKey equal to AltKey! 
If a match is found 

{ 

Call CheckDupRecord (BaseTabie, Newlndex, Altered 
Record) 

If RetumCode = •fail- 
Abort Createlndex Procedure 

} 

Else 

Redo the second newly created audi! record against 
Newlndex 

} i 
Else / * UniquelndexFlag is False V 
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Redo the second newly created audit reord against Newlndex 
} /* end of processing for "table alteration" audit record 

Return 



10 Claims 

1. A computer system for storing and providing user access to data in stored database objects, comprising: 

memory for storing said database objects, wherein said memory resides in a plurality of interconnected com- 
puter nodes; 

a transaction manager, coupled to said memory, for managing computational transactions that add, delete 
and after data stored in said database objects; said transaction manager including audit trail generation in-„ 
structions for generating an audit trail and storing said audit trail in said memory, said audit trail including audit 
records, at least a subset of said audit records each denoting an event selected from the set consisting es-' 
sentially of addition, deletion and alteration of specified data in a specified one of said database objects; 
a restructuring procedure, coupled to said memory, for restructuring a specified one of said database objects, 
said restructuring procedure, including: 

first phase instructions for accessing data in ssid first object and storing corresponding data in a second object 
while allowing continued performance of computational transactions against said first object; 
second phase instructions, for execution afte: said first phase instructions, for accessing said audit records in 
said audit trail created by said transaction manager during execution of said first phase instructions and there- 
after until execution of said second phase ins-ructions is completed; said second phase instructions updating 
said data stored in said second object by reqoing with respect to said second object each event denoted by 
said accessed audit trail records; and : 

third phase instructions, for execution after said second phase instructions, for obtaining a lock on said first 
object so as to prevent continued performance of computational transactions against said first object, and for 
then accessing audit records in said audit trail created by said transact on manager after execution of said 
second phase instructions; said third phase instructions updating said data in said second object by redoing 
with respect to said second object each event denoted by said accessed audit trail records. 

The computer system of claim 1 , 

said database objects including database tables and database indexes, each database table having a plurality 
of columns, a plurality of records having primary key values based on a first specified subset of said columns, 
and a primary key index based on said primary key values; 

said restructuring procedure comprising a create index procedure for creating an alternate key index having 
a unique alternate key value for each record; 

said first object comprising one of said database tables and said second object comprising an alternate key 
index generated by said create index procedure; 

said first phase instructions including instructions for accessing said records in said one database table and 
instructions for generating and storing a corresponding index record in said alternate key index, each record 
in said alternate key index including said primary key value for said corresponding database table record and 
an alternate key value based on a second specified subset of said columns; 

said second phase and third phase instructions including instructions for deleting an index record in said 
alternate key index when one of said accessed audit records indicates an event making said alternate key 
index record invalid, and instructions for adding index records to said alternate key index when said accessed 
audit records indicate events that add new database records and events that alter any field of an existing 
database record corresponding to any column in said first and second specified subsets of columns; and 
said generating and storing instructions in said first phase instructions, and said adding index records instruc- 
tions in said second phase and third phase instructions including instructions for determining whether two 
records in said one database table have identical alternate key index values and for aborting said create index 
procedure when two such records are found. 
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3. A computer system for storing and providing user access to data in stored databases, comprising: 

memory for storing said databases, said databases including database tables, each database table having a 
plurality of columns, a primary key index based on a specified subset of said columns, and an associated 
s schema denoting information about said table's columns; wherein said memory resides in a plurality of inter- 

connected computer nodes; 

at least a subset of said database tables each including a plurality of partitions, each partition storing records 
having primary key values in a primary key range distinct from all others of said plurality of partitions; each 
partition stored as a separate data structure in said memory; 

w a transaction manager, coupled to said memory, for managing computational transactions that add, delete 

and alter records in said database tables; said transaction manager including audit trail generation instructions 
for generating an audit trail and storing said audit trail in said memory said audit trail including audit entries, 
each audit entry denoting a database table record event selected from the set consisting essentially of addition, 
deletion and alteration of a specified database table record in a specified one of said database tables; 

75 a move partition procedure, coupled to said memory, for moving any specified one of said partitions for one 

of said database tables having a plurality of partitions from a first memory location to a second memory location; 
said move partition procedure, including: ' 

first phase instructions for creating a new partition at said memory location and for copying all records of said 
specified partition into said new partition, using read only access to said records of said specified partition 

20 while performing said copying; 

second phase instructions, for execution after said first phase instructions, for accessing audit records in said 
audit trail created by said transaction manager during execution of said first phase instructions and thereafter 
until execution of said second phase instructions is completed; said second phase instructions updating said 
records in said new partition in so as to redo each database table record event denoted by said accessed 

25 audit trail records; and 

third phase instructions, for execution after said second phase instructions, for obtaining a lock on said one 
database table, then accessing audit records in said audit trail created by said transaction manager after 
execution of said second phase instructions; said third phase instructions updating said records in said new 
partition in so as to redo each database table record event denoted by said accessed audit trail records; said 

30 third phase instructions further including instructions for updating said schema information for said one data- 

base table and then releasing said lock on said one database.table. 
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Command: MovePartition(PartitionlD, NewDiskLocation) 

1 } ■ < ,-220 



Phase 1: "Dirty Copy" 

Create file for NewPartltion. 

Create file label for NewPartition. 

Save position of last record in Audit Trail as AuditPtr. 

While allowing user transactions to continue accessing the 
specified OldPartition, copy all records of the OldPartition 
Into the NewPartition without regard to the fact that records 
in the OldPartition may be in the process of being changed. 

i c222_ 

Phase 2: "Partial Cleanup" 

Set Filter for AuditTrail to access only audit trail records 
that pertain to the OldPartition. 

Starting with the audit record in the Audit Trail at the initial 
AuditPtr value, repeat the following steps for each 
sequential audit record that passes the Filter until the end 
of the Audit Trail is reached: 

Modify the audit record to refer to the NewPartition. 

If the change to the table noted in the audit record is 

already reflected in the copied records. 
Ignore the audit record 

Otherwise 

Perform a "redo" with respect to the modified audit 
record so as to apply the change noted in the audit 
record to the NewPartition. This includes updating 
the B-tree for the NewPartition as needed to 
perform the change. 

FIGURE 4A r 
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Phase 3: "Final Cleanup" 

Request and wait for lock on the OldPartition. 
As a unitary transaction: 
Repeat the following steps for each sequential audit 
record that passes the Filter until the last such audit 
record has been processed: 
Modify the audit record to refer to the NewPartition. 
If the change to the table noted in the audit record is 
already reflected in the copied records. 

Ignore the audit record 
Otherwise 

Perform a "redo" with respect to the modified audit 
record so as to apply the change noted in the audit 
record to the NewPartition. This includes updating 
the B-tree for the NewPartition as needed to 
perform the change. 

Request and wait for lock on the Base Table. 

Delete Catalog entry for OldPartition 

Create Catalog entry for NewPartition 

Update catalog entries and file labels for all other 

partitions of the Base Table. 

Release Lock on BaseTable. 

In Background, delete OldPartition and release disk 
space used by OldPartition file 

Conclude transaction. 



FIGURE 4B 
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Command: SplitPartition(OldPartition, OldRange, 
NewRangel, NewRange2, NewPartition) 

\ ^ 30 



Phase 1: "Dirty Copy" 

Create file for NewPartition. 
Create file label for NewPartition. 
' Save position of last record in Audit Trail as AuditPtr. 

While allowing user transactions to continue accessing the 
specified OldPartition, copy all records of the OldPartition 
having a Primary Key value in NewRange2 into the 
NewPartition without regard to the fact that records in the 
OldPartition may be in the process of being changed. 



I 



^232 



Phase 2: "Partial Cleanup" . 

Set Filter for AuditTrail to access only audit trail records 
that pertain to OldPartition records with a Primary key 
value in NewRange2. 

Starting with the audit record in the Audit Trail at the initial 
AuditPtr value, repeat the following steps for each 
sequential audit record that passes the Filter until the end 
of the Audit Trail is reached: 

Modify the audit record to refer to the NewPartition. 

If the change to the table noted in the audit record is 

already reflected in the copied records. 
Ignore the modified audit record 

Otherwise 

Perform a "redo" with respect to* the modified 
audit record so as to apply the change noted in 
the audit record to the NewPartiaon. This 
includes updating the B-tree for/the 
NewPartition as needed to perform the change. 
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Phase 3: "Final Cleanup" 

Request and wait for lock on the OldPartition. 
As a unitary transaction: 

Read each audit record in the Audit Trail that references 
the OldPartition and repeat the following steps for each 
sequential audit record that passes the Filter until the 
last such audit record has been processed: 

Modify the audit record to refer to the NewPartition. 
If the change to the table noted in the audit record 
is already reflected in the copied records. 

Ignore the audit record 
Otherwise 

Perform a "redo" with respect to the modified 
audit record so as to apply the change noted in 
the audit record to the NewPartition. This 
includes updating the B-tree for the 
NewPartition as needed to perform the change. 

Split B-tree for OldPartition into two parts, one for 
records in NewRangel and one for records in 
NewRange2. 

Make NewRange2 in Old Part ion inaccessible. 
Request and wait for lock on the Base Table. 
Delete Catalog entry for OldPartition 
Create Catalog entry for NewPartition 
Update catalog entries and file labels for all other 
partitions of the Base Tattle. 
Release Lock on BaseTable. 
Conclude transaction. 

In Background, delete all records in OldPartition 
referenced by B-tree for records in NewRange2 



FIGURE 5B 
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Command: MovePartitionBoundary(Partition1, OldRangel, 
NewRangel, Partition2, OldRange2; NewRange2) 

i ^AO 

Phase 1: "Dirty Copy" 

Save position of last record in Audit Trail as AuditPtr. 
MoveRange = range of Primary Key values In NewRange2 
that are not in OldRange2. 

While allowing user transactions to continue accessing the 
specified Partitionl, copy all records of Partitionl having a 
Primary Key value in the MoveRange into Partition2 without 
regard to the fact that records in Partitionl may be in the 
process of being changed. ____ 

Phase 2: "Partial Cleanup" 

Set Filter for AuditTrail to access only audit trail records 
that pertain to the Partitionl records with a Primary key 
value in MoveRange. 

Starting with the audit record in the Audit Trail at the initial 
AuditPtr value, repeat the following steps for each 
sequential audit record that passes the Filter until the end 
of the Audit Trail is reached: 

Modify the audit record to refer to Partition^ 

If the change to the table noted In the audit record 

is already reflected in the copied records. 
Ignore the modified audit record 

Otherwise 

Perform a "redo" with respect to the modified audit 
record so as to apply the change noted in the audit 
record to Partition2. This includes updating the 
B-tree for Partition2 as needed to perform the 
change. 




FIGURE 6A 
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Phase 3: "Final Cleanup" 

Request and wait for lock on the OldPartition. 
As a unitary transaction: 
Read each audit record in the Audit Trail and repeat the 
following steps for each sequential audit record that 
passes the Filter until the last such audit record has 
been processed: 

Modify the audit record to refer to the Partition2. 
If the change to the table noted in the audit record 
is already reflected in the copied records. 

Ignore the audit record 
Otherwise 

Perform a "redo" with respect to the modified 
audit record so as to apply the change noted in 
the audit record to the Partition2. This includes 
updating the B-tree for Partition2 as needed to 
perform the change. 

Split B-tree for Partitionl into two parts, one for records 
in NewRangel and one for records in MoveRange. 
Make MoveRange in Partitionl inaccessible. 
Request and wait for lock on the Base Table. 
Update catalog entries and file labels for Partitionl and 
Partitions 

Update catalog entries and file labels for all other 
partitions of the Base Table. 
Release Lock on BaseTable. 
Conclude transaction. 

In Background, delete all records in Partitionl 
referenced by B-tree for records in MoveRange 



FIGURE 6B 



FIGURE 6A 



FIGURE 6B 
FIGURE 6 



39 



EP 0 723 238 A1 



Command: Createlndex(BaseTable, AltKeyColumns, 
UniquelndexFlag, Newlndex) 



Phase 1: "Dirty Copy" 

Save position of last record in Audit Trail as AuditPtr. 

While allowing user transactions to continue accessing the specified 
BaseTable, create Newlndex record for each record in the BaseTable: 




Create Newlndex Record 251 
AltKeyl = Create_AltKey(Record, AltKeyColumns) 
PrimaryKeyl = Create_PrimaryKey(Record. PrimarytKeyColumns) 
If UniquelndexFlag 
Prior to storing each Newlndex record, check to see if Newlndex 
already has a record with an alternate index value of AltKeyl . 
If one is found 

Check to see if (A) the record still has index values of 
AltKeyl and PrimaryKeyl, and (B) if there is another record 
in the BaseTable having an alternate key value of AltKeyl. 
If both A and B are true 

Abort Createlndex - conflicting database records found 
Else 

Store new index record in Newlndex 

Else 

Store new index record in Newlndex 
Else r UniquelndexFlag = False 7 
Store new index record in Newlndex 





FIGURE 7A 
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Phase 2: "Partial Cleanup" 

Set Filter for AuditTrail to access only audit trail records 
that pertain to the BaseTable. 

Starting with the audit record in the Audit Trail at the initial 
AuditPtr value, repeat the following steps for each 
sequential audit record that passes the Filter until the end 
of the Audit Trail is reached: 

Call Audit Fixup Procedure for C reate Index ^53 

Audit Fixup Procedure for Create Index 

If the audit record denotes a change to a base table record 
that does not alter either the Primary Key or the AltKey of 
the record, or is already reflected in the Newlndex 

Ignore the audit record 
Elseif the audit record denotes deletion of a base table 
record 

Create an Audit Record denoting deletion of associated 

Newlndex record, if one exists 
Elseif the audit record denotes addition of a base table record 

Create an Audit Record denoting creation of Newlndex 

Record (using same procedure as in Phase 1) 
Else /* audit record denotes alteration of a base table record 7 

Create an Audit Record denoting deletion of the 

Newlndex record, if one exists, for altered record's prior 

alternate key value 

Create an Audit Record denoting creation of a Newlndex 
Record for altered record 

Perform a "redo* of the created audit record(s) so as to apply 
changes to the index that are needed for the change to the 
BaseTable, as recorded by the audit record. Update the 
B-tree for the index as needed to perform the changes. 




FIGURE 7B 
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Phase 3: "Final Cleanup" 

Request and wait for lock on the Base Table. 

As a unitary transaction: 
Read each audit record in the Audit Trail, starting with 
the first record created after the end of Phase 2, and 
repeat the following steps for each sequential audit 
record that passes the Filter until the last such audit 
record has been processed: 
Call Audit Fixup Procedure for Create Index 

Create Catalog entry for Newlndex 

Update file labels for all partitions of the Base Table. 

Release Lock on BaseTable. 

Conclude transaction. 



FIGURE 7C 
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